Summary:
There is a strong need for synthetic yet realistic distribution system test data sets that are as diverse, large, and complex to solve as real systems. Such data sets can facilitate the development of advanced algorithms and the assessment of emerging distributed energy resources while avoiding the need to acquire proprietary critical infrastructure or private data. Such synthetic data sets, however, are useful only if they are realistic enough to look and behave similarly to actual systems. This paper presents a comprehensive framework for validating synthetic distribution data sets using a three-pronged statistical, operational, and expert validation approach. It also presents a set of statistical and operational metric targets for achieving realistic data sets based on detailed characterization of more than 10,000 real U.S. utility feeders. The paper demonstrates the use of the proposed validation approach to validate three large-scale synthetic data sets developed by the authors representing Santa Fe, New Mexico; Greensboro, North Carolina; and the San Francisco Bay Area, California.
Keywords: Electric distribution test feeders, synthetic data sets, validation, statistical metrics, power flow, smart grid use case.
JCR Impact Factor and WoS quartile: 8,960 - Q1 (2020); 8,600 - Q1 (2023)
DOI reference: https://doi.org/10.1109/TSG.2020.2981077
Published on paper: September 2020.
Published on-line: March 2020.
Citation:
V. Krishnan, B. Bruce, T. Elgindy, C. Mateo, P. Dueñas, F. Postigo Marcos, J.S. Lacroix, T. Gómez, B. Palmintier, Validation of synthetic U.S. electric power distribution system data sets. IEEE Transactions on Smart Grid. Vol. 11, nº. 5, pp. 4477 - 4489, September 2020. [Online: March 2020]